Learning Mixed Multinomial Logit Model from Ordinal Data

نویسندگان

  • Sewoong Oh
  • Devavrat Shah
چکیده

Motivated by generating personalized recommendations using ordinal (or preference) data, we study the question of learning a mixture of MultiNomial Logit (MNL) model, a parameterized class of distributions over permutations, from partial ordinal or preference data (e.g. pair-wise comparisons). Despite its long standing importance across disciplines including social choice, operations research and revenue management, little is known about this question. In case of single MNL models (no mixture), computationally and statistically tractable learning from pair-wise comparisons is feasible. However, even learning mixture with two MNL components is infeasible in general. Given this state of affairs, we seek conditions under which it is feasible to learn the mixture model in both computationally and statistically efficient manner. We present a sufficient condition as well as an efficient algorithm for learning mixed MNL models from partial preferences/comparisons data. In particular, a mixture of r MNL components over n objects can be learnt using samples whose size scales polynomially in n and r (concretely, rn(log n), with r n when the model parameters are sufficiently incoherent). The algorithm has two phases: first, learn the pair-wise marginals for each component using tensor decomposition; second, learn the model parameters for each component using RANKCENTRALITY introduced by Negahban et al. In the process of proving these results, we obtain a generalization of existing analysis for tensor decomposition to a more realistic regime where only partial information about each sample is available.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multinomial logit random effects models

This article presents a general approach for logit random effects modelling of clustered ordinal and nominal responses. We review multinomial logit random effects models in a unified form as multivariate generalized linear mixed models. Maximum likelihood estimation utilizes adaptive Gauss–Hermite quadrature within a quasi-Newton maximization algorithm. For cases in which this is computationall...

متن کامل

Learning from Comparisons and Choices

When tracking user-specific online activities, each user’s preference is revealed in the form of choices and comparisons. For example, a user’s purchase history tracks her choices, i.e. which item was chosen among a subset of offerings. A user’s comparisons are observed either explicitly as in movie ratings or implicitly as in viewing times of news articles. Given such individualized ordinal da...

متن کامل

Data Augmentation, Frequentist Estimation, and the Bayesian Analysis of Multinomial Logit Models

This article introduces a generalization of Tanner and Wong’s data augmentation algorithm which can be used when the complete data posterior distribution cannot be directly sampled. The algorithm proposes parameter values based on complete data sampling distributions of convenient frequentist estimators which ignore some information in the complete data likelihood. The proposals are filtered us...

متن کامل

Multinomial logit models with implicit variable selection

Multinomial logit models which are most commonly used for the modeling of unordered multi-category responses are typically restricted to the use of few predictors. In the high-dimensional case maximum likelihood estimates frequently do not exist. In this paper we are developing a boosting technique called multinomBoost that performs variable selection and fits the multinomial logit model also w...

متن کامل

R Package multgee: A Generalized Estimating Equations Solver for Multinomial Responses

This introduction to the R package multgee is a slightly modified version of ?, published in the Journal of Statistical Software. To cite multgee in publications, please use ?. To cite the GEE methodology implemeted in multgee, please use ?. The R package multgee implements the local odds ratios generalized estimating equations (GEE) approach proposed by ?, a GEE approach for correlated multino...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014